1 Introducing Finite Automata 


1.1 Problems and Computation 


Decision Problems 


Decision Problems 
Given input, decide “yes” or “no” 


e Examples: Is x an even number? Is x prime? Is there a path from s to t in graph G? 


e i.e., Compute a boolean function of input 


General Computational Problem 
In contrast, typically a problem requires computing some non-boolean function, or carrying out an 
interactive/reactive computation in a distributed environment 


e Examples: Find the factors of x. Find the balance in account number z. 


e In this course, we will study decision problems because aspects of computability are captured 
by this special class of problems 


What Does a Computation Look Like? 


e Some code (a.k.a. control): the same for all instances 
e The input (a.k.a. problem instance): encoded as a string over a finite alphabet 
e As the program starts executing, some memory (a.k.a. state) 


— Includes the values of variables (and the *program counter") 
— State evolves throughout the computation 


— Often, takes more memory for larger problem instances 


e But some programs do not need larger state for larger instances! 


1.2 Finite Automata: Informal Overview 


Finite State Computation 


e Finite state: À fixed upper bound on the size of the state, independent of the size of the input 


— A sequential program with no dynamic allocation using variables that take boolean 
values (or values in a finite enumerated data type) 


— ]f t-bit state, at most 2* possible states 
e Not enough memory to hold the entire input 


— "Streaming input": automaton runs (i.e., changes state) on seeing each bit of input 


An Automatic Door 


Front Rear 
pad pad 


door 


Figure 1: Top view of Door 
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Figure 2: State diagram of controller 


e Input: A stream of events «front», «rear», «both», «neither» ... 


e Controller has a single bit of state. 


Finite Automata 
Details 


Automaton 
A finite automaton has: Finite set of states, with start/initial and accepting/final states; Transitions 
from one state to another on reading a symbol from the input. 


Computation 
Start at the initial state; in each step, read the next symbol of the input, take the transition (edge) 
labeled by that symbol to a new state. 

Acceptance/ Rejection: If after reading the input w, the machine is in a final state then w is 
accepted; otherwise w is rejected. 


Figure 3: Transition Diagram of automaton 


Conventions 


e The initial state is shown by drawing an incoming arrow into the state, with no source. 


e Final/accept states are indicated by drawing them with a double circle. 


Example: Computation 


e On input 1001, the computation is 


1. Start in state qo. Read 1 and goto q1. 

2. Read 0 and goto qj. 

3. Read 0 and goto qı. 

4. Read 1 and goto qo. Since qo is not a final state 1001 is rejected. 


e On input 010, the computation is 


1. Start in state qo. Read 0 and goto qo. 
2. Read 1 and goto qj. 
3. Read 0 and goto qı. Since qı is a final state 010 is accepted. 


1.8 Applications 


Finite Automata in Practice 


e grep 

e Thermostats 

e Coke Machines 

e Elevators 

e Train Track Switches 
e Security Properties 


e Lexical Analyzers for Parsers 


2 Formal Definitions 


2.1 Deterministic Finite Automaton 


Defining an Automaton 


To describe an automaton, we to need to specify 
e What the alphabet is, 
e What the states are, 
e What the initial state is, 
e What states are accepting/final, and 
e What the transition from each state and input symbol is. 


Thus, the above 5 things are part of the formal definition. 


Deterministic Finite Automata 
Formal Definition 


Definition 1. A deterministic finite automaton (DFA) is M = (Q, £, ô, qo, F), where 
e Q is the finite set of states 
e » is the finite alphabet 


e ô: Qx X — Q "Next-state" transition function 


do | do 41 
qı | q1 40 


Figure 5: Transition Table representation 


e qo € Q initial state 
e F C Q final/accepting states 


Given a state and a symbol, the next state is *determined". 


Formal Example of DFA 


Example 2. 
Figure 4: Transition Diagram of DFA 


Formally the automaton is M = ({qo, q1}, {0, 1}, 6, go, {a1 }) where 


ó(qo, 0) = qo ôlqo, 1) = q 
ó0(n,0)—q ó(q1, 1) = qo 


Computation 


Definition 3. For à DFA M = (Q, £, ô, qo, F), string w = w1w2-:-- wj, where for each i w; € X, 
w . . 
and states q1,q2 € Q, we say q1 —>>m qo» if there is a sequence of states ro, r1, ... rj such that 


e ro = di, 


e for each i, Ó(ri,wi,1) = ri41, and 


e rj = Q2. 


Definition 4. For a DFA M = (Q, £, ô, qo, F) and string w € X*, we say M accepts w iff qq —>m q 


for some q € F. 


Useful Notation 


Definition 5. For a DFA M = (Q, X, ô, qo, F), let us define a function ôm : Q x X* + P(Q) such 
that óy(q,w) = {qd € Q|q Sm q'}. 
We could say M accepts w iff ôm (qo, w) NF z 0. 


Proposition 6. For a DFA M = (Q,X,0,qo, F), and any q € Q, and w € X*, |óy(q, w)| = 1. 


Acceptance/Recognition 


Definition 7. The language accepted or recognized by a DFA M over alphabet © is L(M) = {w € 
&* | M accepts w}. A language L is said to be accepted/recognized by M if L = L(M). 


2.2 Examples 


Example I 
0,1 
Figure 6: Automaton accepts all strings of 0s and 1s 
Example II 


Figure 7: Automaton accepts strings ending in 1 


Example III 


Figure 8: Automaton accepts strings having an odd number of 1s 


Example IV 
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Figure 9: Automaton accepts strings having an odd number of 1s and odd number of 0s 


3 Designing DFAs 


3.1 General Method 
Typical Problem 


Problem 
Given a language L, design a DFA M that accepts L, i.e., L(M) = L. 


Methodology 


e Imagine yourself in the place of the machine, reading symbols of the input, and trying to 
determine if it should be accepted. 


e Remember at any point you have only seen a part of the input, and you don't know when it 
ends. 


e Figure out what to keep in memory. It cannot be all the symbols seen so far: it must fit into 
a finite number of bits. 


3.2 Examples 


Strings containing 0 


Problem 
Design an automaton that accepts all strings over (0, 1) that contain at least one 0. 


Solution 
What do you need to remember? Whether you have seen a 0 so far or not! 


Figure 10: Automaton accepting strings with at least one 0. 


Even length strings 


Problem 
Design an automaton that accepts all strings over (0, 1) that have an even length. 


Solution 
What do you need to remember? Whether you have seen an odd or an even number of symbols. 


Figure 11: Automaton accepting strings of even length. 


Pattern Recognition 


Problem 
Design an automaton that accepts all strings over {0,1} that have 001 as a substring, where u is a 
substring of w if there are wı and w» such that w = wyuwe. 


Solution 
What do you need to remember? Whether you 


e haven't seen any symbols of the pattern 


e have just seen 0 
e have just seen 00 


e have seen the entire pattern 001 


Pattern Recognition Automaton 


I 0 0,1 
ES EO 


Figure 12: Automaton accepting strings having 001 as substring. 


grep Problem 


Problem 
Given text T and string s, does s appear in T? 


Naive Solution 


—s? 
=s? 
=s? 
=s? 
=s? 


iN 
TUI Iq arcum 


Running time — O(nt), where |T| — t and |s| 2 n. 


grep Problem 
Smarter Solution 


Solution 
e Build DFA M for L = {w| there are u,v s.t. w = usv} 
e Run M on text T 

Time = time to build M + O(t)! 


Questions 


e Is L regular no matter what s is? 
e If yes, can M be built “efficiently”? 


Knuth-Morris-Pratt (1977): Yes to both the above questions. 


Multiples 


Problem 
Design an automaton that accepts all strings w over (0, 1) such that w is the binary representation 
of a number that is a multiple of 5. 


Solution 
What must be remembered? 'The remainder when divided by 5. 
How do you compute remainders? 


e If w is the number n then w0 is 2n and wl is 2n +1. 
e (a.b +c) mod 5 = (a.(b mod 5) +c) mod 5 


e e.g. 1011 = 11 (decimal) = 1 mod 5 10110 = 22 (decimal) = 2 mod 5 10111 = 23 (decimal) 
= 3 mod 5 


Automaton for recognizing Multiples 


Figure 13: Automaton recognizing binary numbers that are multiples of 5. 


A One k-positions from end 


Problem 


10 


Design an automaton for the language Lj, = {w | kth character from end of w is 1} 


Solution 
What do you need to remember? The last k characters seen so far! 
Formally, Mi, = (Q, (0,11,6, qo, F`) 


e States = Q = {(w) | w € {0,1}* and |w| € k} 


_ f (wb) if |w| < k 


* qo = (6 
e F= [(1uows ... wy) | Wi E {0, 1}} 
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